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ABSTRACT 

Guinea fowl coronavirus (GfCoV) causes fulminating enteritis that can result in 
a daily death rate of 20% in guinea fowl flocks. Here we studied GfCoV diversity 
and evaluated its phenotypic consequences. Over the period 2014-2016, 
affected guinea fowl flocks were sampled in France and avian coronavirus 
presence was confirmed by PCR on intestinal content and 
immunohistochemistry of intestinal tissue. Sequencing revealed 89% amino 
acid identity between the viral attachment protein S1 of GfCoV/2014 and the 
previously identified GfCoV/2011. To study the receptor interactions as a 
determinant for tropism and pathogenicity, recombinant S1 proteins were 
produced and analyzed by glycan and tissue arrays. Glycan array analysis 
revealed that viral attachment S1 proteins from GfCoV/2014 and GfCoV/2011 
can, in addition to the previously elucidated biantennary diLacNAc receptor, 
bind to glycans capped with alpha 2,6-linked sialic acids. Interestingly, 
recombinant GfCoV/2014-S1 has an increased affinity for these glycans 
compared to GfCoV/2011-S1, which was in agreement with the increased 
avidity of GfCoV/2014-S1 for gastrointestinal tract tissues. Enzymatic removal 
of receptors from tissues before applying spike proteins confirmed the 
specificity of S1 tissue binding. Overall, we demonstrate that diversity in 
GfCoV 51 proteins results in differences in glycan and tissue binding 


properties. 


IMPORTANCE Avian coronaviruses cause major global problems in the poultry 
industry. As causative agents of huge economical losses, the detection and 


understanding of the molecular determinants of viral tropism is of ultimate 
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importance. Here we set out to study those parameters and obtained in-depth insight 
in the virus-host interactions of guinea fowl coronavirus (GfCoV). Our data indicate 
that diversity in GfCoV viral attachment proteins result in differences in affinity for 
glycan receptors, as well as altered avidity for intestinal tract tissues, which might 


have consequences for its tissue tropism and pathogenesis in guinea fowls. 


INTRODUCTION 

Avian coronaviruses (AvCoV) pose a major threat to poultry health, production and 
welfare worldwide. AvCoVs are highly infectious, remain endemic in poultry 
populations and, due to their high mutation rate, frequently produce new antigenic 
variants (1, 2). The best-known AvCoV is infectious bronchitis virus (IBV), causing 
mainly respiratory disease in chickens. In addition, IBV-like viruses have been 
detected in other domestic poultry, including turkey and quail (3-5). In guinea fowl, 
coronaviruses have been identified for the first time in 2011 as the causative agent 
for fulminating enteritis(6). Full genome sequencing revealed that guinea fowl 
coronavirus GfCoV/FR/2011 is closely associated with turkey coronavirus (TCoV) (7), 
both causing gastrointestinal tract infections in their respective host (6, 8). Clinical 
signs related to GfCoV infection in guinea fowl include prostration, ruffled feathers, 
decreased water and feed consumption, and have resulted in a daily death rate up to 
20% in several farms in France (6). Upon necropsy, whitish and enlarged pancreases 
were consistently reported. Histopathological analyses revealed pancreatic necrosis 
and lesions of various intensities in the intestinal epithelium, with most severe lesions 


found in the duodenum of affected animals (6). 
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Genetic classification of AvCoVs is based on phylogenetic analysis of the S1 domain 
of its viral attachment protein spike (2). The spike protein is the main determinant for 
tropism (9), and the N-terminal part of the S1 of IBV has been shown to contain the 
receptor-binding domain (RBD) (10). Studies using recombinant IBV-S1 and/or -RBD 
proteins have demonstrated that the viral tropism is reflected by tissue binding of 
such proteins (11). Mutations in the spike proteins of IBV might either result in 
decreased (10) or increased (12) avidity for its receptor present on epithelial cells of 
the chicken trachea. In contrast to IBV, GfCoV and TCoV target the epithelial cells of 
the gastrointestinal tract (4, 6), and recombinant protein binding of their S1 proteins 
reflects this viral tropism, with predominant staining of epithelial cells of the small 
intestine (4). Glycan array analysis identified elongated LacNAcs on branched N- 
glycans as the host receptor for enteric AvCoVs, which are abundantly expressed on 


intestinal tissues (4). 


Clinical symptoms in guinea fowl similar to those reported in 2011 are continuously 
reported by veterinarians in France (personal communication). However, studies on 
newly emerging GfCoVs are particularly hampered by the lack of models to grow the 
virus. More specifically, susceptible cell lines have not yet been identified, inoculation 
of embryonated guinea fowl eggs did not result in GfCoV production (data not 


shown), and SPF guinea fowls are not available for experimental infection. 


Here we set out to study the consequences of GfCoV genetic diversity for glycan and 
tissue interactions. We revealed that the GfCoV spike gene from the 2014-2016 
outbreak in guinea fowl flocks in France was 89% identical to that of GfCoV/2011 (7). 


Glycan and tissue binding analyses of GfCoV/2011 and GfCoV/2014 recombinant 
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spike S1 revealed that, while both proteins had the same specificity, GfCoV/2014-S1 
had a much higher affinity toward glycan receptors and tissues of the lower 
gastrointestinal tract, in agreement with the observed replication of the virus in these 
tissues from field cases. Taken together we demonstrate GfCoV diversity results in 


phenotypically different receptor binding properties. 


RESULTS 

Lesions and coronaviral protein expression in the gastrointestinal tract of 
diseased guinea fowls between 2014-2016. 

Fulminating disease (peracute enteritis) in guinea flocks continued to be reported 
after the initial outbreak of GfCoV infection in 2011 (6). Between February 2014 and 
November 2016, duodena from 29 diseased guinea fowls were collected and 
analyzed for lesions and coronaviral protein expression. Histological analysis of 
tissues by H&E staining revealed lesions in all duodena, with clear infiltration of 
inflammatory cells in remnants of the villi (Fig. 1, black arrowheads). For seven 
animals the entire gastrointestinal tract was available for histological analysis, 
showing lesions across the entire length of the intestinal tract, including the colon 
(Fig 1, black arrowheads). Viral protein expression using antibodies against the M 
protein of avian coronaviruses was observed in all duodena and four out of the seven 
lower intestinal tracts by immunohistochemistry (Fig. 1, white arrowheads). In the 
colons devoid of expression of viral proteins, the infiltration of inflammatory cells was 


noted, suggestive of a previous exposure to a virologic agent. 


In contrast to what we observed, virus replication of GfCoV/2011 appeared to be 


restricted to the duodenum (6). Unfortunately, we were unable to confirm the lack of 
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infection of lower gastrointestinal tract samples in the previous outbreak due to 
unavailability of samples. Nevertheless, we here hypothesize that genetically 
divergent GfCoVs might have caused phenotypic differences in guinea fowls over the 


years. 


Circulation of genetically diverse GfCoV. 

Gastrointestinal content collected from twenty affected animals between February 
2014 and November 2016 were analyzed for the presence of gammacoronavirus 
genetic material by one-step real-time RT-PCR using pan-gammacoronavirus 
primers (13). For all samples, Ct values obtained were below 35 (data not shown), 
confirming the presence of coronaviral RNA in all tested samples (Table 1). Next, 
overlapping conventional PCRs were performed with primers based on the spike 
gene of the GfCoV/2011 virus (sequences available upon request). Partial S1 
sequences could be obtained from ten out of twenty RT-PCR positive samples (893- 
1641በሆ 3624nt for complete S, Table1), the quality and/or quantity of the remaining 
ten samples was too low to generate PCR products. Sanger sequencing of the 
obtained fragments confirmed the presence of GfCoV in the intestinal content of all 


ten birds, confirming continuous GfCoV circulation in France. 


Phylogenetic analysis was performed to investigate the genetic diversity of the 
obtained partial S1 sequences using Maximum likelihood analyses (Fig. 2). The 
results showed that the 2014/2016 sequences clearly clustered with the S1 reference 
gene from GfCoV/2011 (NCBI HF544506) supported by a bootstrap value of 100, 


while they were genetically more distantly related to TCoV. Each of the GfCoV- 
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2014/2016 partial S1 sequences shared 84-90% nt identity with GfCoV/2011 and 


between the 2014-2016 partial S1 sequences the variation was 0.1 to 8.0%. 


Only from one sample a full spike sequence could be obtained (yCoV/AvCoV/guinea 
fowl/France/14032/2014; NCBI MG765535), while for the others the amount and/or 
quality of the viral RNA samples were too low for further analyses. Comparison of the 
S1 gene of GfCoV/2014 with that of GfCoV/2011 using the Kimura 2-parameter 
distance model indicated that the genes had an 85% nucleotide and 89% amino acid 
sequence identity. Alignment of the amino acid sequences did not indicate clear 
mutation hotspots (data not shown) and the huge sequence diversity with IBV-M41- 
S1 (the only avian coronavirus for which a cryo-EM structure has been elucidated 


(14)) impairs further suggestions on the implications of each of the mutations. 


GfCoV/2014-S1 recognizes the enteric coronavirus diLacNAc glycan receptor 
with higher affinity than GfCoV/2011-S1. 

Using the glycan array of the Consortium for Functional Glycomics, we previously 
determined that S1 from GfCoV/2011 specifically binds to the diLacNAc glycan 
receptors (Gal_1,4GIcNAc_1,3Gal_1,4GIcNAc) (4). To study whether the observed 
changes in the spike of GfCoV/2014 resulted in differences in recognition of this 
glycan receptor, we recombinantly produced GfCoV/2014-S1 and GfCoV/2011-S1 
and applied both proteins to diLacNAc-PAA conjugates in an ELISA as previously 
described (4). At similar protein concentrations GfCoV/2014-S1 showed improved 
binding to this receptor (Fig. 3), indicating that the mutations in S1 did not affect the 


specificity, but resulted in significant higher affinity, for this particular receptor. 
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The genetic differences between GfCoV/2014 and /2011 did not alter glycan 
specificity. 

Next, we investigated whether the mutations in S1 resulted in recognition of 
additional N-linked glycans. To this end, both S1 proteins were applied to a novel 
glycan array containing N-glycan structures with their linear counterparts, either with 
terminal galactose or two differently linked sialic acid moieties (F. Broszeit and R.P. 
de Vries, submitted for publication). Schematic representations of each of the 
glycans are given in Fig. 4A. The data revealed that both GfCoV-S1 proteins bind to 
longer biantennary LacNAc structures (Fig. 4B, structures #3-4), including the 
diLacNAc structure used in the ELISA (Fig. 3). Furthermore, both GfCoV-S1 proteins 
bound to longer linear LacNAc repeats (Fig. 4B, structure #1), which were not 
included in the previous array (4). Finally, both GfCoV-S1 proteins bound longer 
linear and biantennary LacNAc repeats with terminal alpha 2,6 sialic acid (Fig. 4B, 
structures #9-12), but not those capped with alpha 2,3 linked sialic acids (Fig. 4B, 
structures #5-8). Erythrina cristagalli lectin (ECA), Sambucus nigra lectin (SNA) and 
Maackia Amurensis Lectin | (MAL1) were taken along as controls. We observed as 
expected specific binding to galactose, alpha 2,6 linked and alpha 2,3 linked sialic 
acid terminal glycans, respectively (Fig. 4C). In conclusion, both GfCoV-S1 proteins 
show specificity for the same glycans, ending with either galactose or alpha 2,6 
linked sialic acids on the glycan array. However, the relative fluorescence observed 
for GfCoV/2014-S1 was consistently higher when compared to GfCoV/2011-S1, 
which is suggestive for differences in affinity for glycan receptors, as was observed 


for diLacNAcs in Fig. 3. 


199በ6 Aq eLOz ‘yı yoe uo /640”ህ|85|ለሆ/ጀ991ህ wos pepeojuwoq 


ዐን 
= 
6 
O 
ዐን 
=+ 
N 
(9) 
A- 
ሓሸ - 
o. 
1 
Q 
ህን 
=) 
(= 
= 
389] 
(45) 
-- 
ር... 
O 
ርጋ 
እው) 
< 


Journal of Virology 


Journal of Virology 


189 


190 


194 


GfCoV/2014-S1 has a higher affinity for glycan receptors compared to /2011-51. 
To allow comparison of the binding affinities of both proteins for each glycan, we 
applied fivefold serial S1 protein dilutions onto the glycan array and compared 
binding intensities at various scan powers. At each concentration, for all glycans 
shown in Figure 4A, binding signals of GfCoV/2014-S1 (Fig. 5A) were consistently 
higher than those of GfCoV/2011-S1 (Fig. 5B). Detection of linear glycan binding 
(glycan #1 and #9) required higher concentrations and scan powers compared to the 
detection of biantennary LacNAc structures (glycans #3-4 and #11-12) for both 
proteins. Interestingly, binding intensity of GfCoV/2011-S1 to glycans with terminal 
alpha 2,6 sialic acids was less compared to binding to glycans with terminal 
galactose (Fig. 5B compare structures #3-4 to #11-12 in 100 ug/mL to 20 ug/mL). 
This difference in preference for galactose-terminal glycans was not observed for 
GfCoV/2014-S1, since binding to glycan structures #3-4 and #11-12 was similar in 
each dilution applied to the array (Fig. 5A). Taken together, the data indicate that 
GfCoV/2014-S1 has a higher affinity for all glycans bound on the array compared to 


/2011-S1. 


GfCoV/2014-S1 has a broader gastrointestinal tract tropism. To reveal whether 
the observed differences in glycan binding properties of the S1 proteins have 
biological consequences for tissue tropism, we first determined whether the identified 
glycans are indeed present on gastrointestinal tract tissues of healthy, uninfected 
guinea fowl. Both SNA and ECA lectins stained the epithelial lining of the duodenum, 
jejunum and caecum intensely, while intermediate staining of the proventriculus and 
colon was observed. In the pancreas only limited binding of SNA was observed, with 


no staining by ECA; in contrast, in the ileum ECA strongly bound whereas SNA 
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bound only to a limited extend. In conclusion, all tissues of the gastrointestinal tract, 


except cloaca, express GfCoV glycan receptors (Table 2) (15). 


Next, we investigated the binding patterns of GfCoV-S1 proteins to gastrointestinal 
tissues. Both proteins stained the epithelial cells of almost the entire gastrointestinal 
tract (duodenum and colon in Fig. 6 1% column; summary of results in Table 2), 
indicating that receptors present on the tissues allow binding of S1. Interestingly, 
staining intensities of the lower intestinal tract (ileum, caecum, colon) were much 
more apparent for GfCoV/2014-S1 than for GfCoV/2011-S1. This prompted us to 
analyze avidity and specificity to glycan receptors in the guinea fowl gastrointestinal 
tissues by GfCoV-S1 proteins. We therefore pre-treated tissue slides with 
Arthrobacter ureafaciens neuraminidase (AUNA) and/or galactosidase to cleave off 
terminal sialic acids and galactose residues from host glycans, respectively. 
Treatment of the tissues with AUNA had only a minor effect on the binding of both 
GfCoV-S1, with a slight decrease in binding intensity to the duodenum for 
GfCoV/2014-S1 (Fig. 6A 2"" column; Table 2). SNA lectin binding was completely 
abolished after pre-treatment with AUNA, confirming that the treatment did effectively 


cleave off all sialic acids from the host glycans (Table 2). 


When galactose residues were removed from the tissues by treatment with 
galactosidase prior to applying ECA, binding was severely reduced or totally absent 
(Table 2). Binding of GfCoV/2011-S1 to the tissue was completely abolished (Fig. 6 
2" column; Table 2), indicating that GÍCoV tissue engagement is almost exclusively 
dependent on the presence of galactose-terminating glycans. On the other hand, 


GfCoV/2014-S1 still clearly bound to the epithelial cells of the intestinal tract, 


10 
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indicating a significant difference in receptor binding avidity (Fig. 6 3"” column; Table 


2). 


Finally, tissues were simultaneously pre-treated with AUNA and galactosidase to 
remove both galactose and sialic acids from the glycans of the host. Indeed binding 
of both ECA and SNA were strongly reduced (Table 2). Tissue binding of 
GfCoV/2011-S1 was completely prevented, while GfCoV/2014-S1 still clearly bound 
to the epithelial cells of the gastrointestinal tract (except pancreas) (Fig. 6 4" column; 
Table 2). These results suggest that either a minor amount of receptors is still 
present, or that yet an additional (glycan) receptor is involved in tissue binding of 


GfCoV/2014-S1. 


DISCUSSION 

In this study we demonstrated ongoing GfCoV circulation in guinea fowl flocks in 
France. The sequence diversity between the viral attachment proteins of GfCoV 
circulating in 2011 and 2014 resulted in differences in receptor binding properties 
with profound phenotypic consequences. This relationship between these findings 
and in vivo pathogenesis can, however, only be elucidated in detail when new 


models to study this virus have been developed. 


An amino acid sequence identity of 89% between viruses circulating only several 
years apart might indicate suggest that either a novel GfCoV strain was introduced in 
France from a yet unidentified source, or that there was high evolutionary pressure 
on the 2011 GfCoV strain. High mutation rates for avian coronaviruses are not 


uncommon (based on full genome sequences around 1.2x10? substitution/site/year 


11 
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(16, 17)). When comparing GfCoV/2011 and /2014-S1 sequences, the calculated 
mutation rate was 5x10° substitution/site/year with a dN/dS ratio of 0.45. Similar 
mutation rates of the spike have been reported for IBV (18) and are believed to be 
driven by selective pressure after vaccination (19, 20). However, no vaccine is 
available against GfCoV, nor against the closely related turkey coronavirus, TCoV. 
Another driver for genetic diversity is the population size (21), however, this is 
unlikely to explain the observed fast mutation rate of GfCoV since flocks are 
considerably smaller compared to chicken flocks. It might well be that circulating 
antibodies against field strains of GfCoV are main drivers of the observed sequence 
diversity. Unfortunately, retrospective studies to further elucidate the contribution of 
virus evolution, the circulation of other virus populations in the last years, or 
introduction of novel strains via for example trade of birds between farms, are 


impossible due to the lack of archive material. 


Here we revealed a novel glycan receptor for GfCoV, the first coronavirus that binds 
N-glycans capped with alpha 2,6 linked sialic acids. Alpha 2,6 sialic acid presence 
has been reported previously in guinea fowl large intestine (15), as well as the 
previously elucidated poly-LacNAc expressed in guinea fowl small intestine (4). 
Together, their expression patterns can explain in large part the tropism of GfCoV, 
but it does not exclude, together with the results presented in this manuscript, that 
yet another host factor plays a role in GfCoV/2014 infection. Initial attempts to show 
whether protein receptors, required for infection of many other coronaviruses (22-24), 


are required were yet unsuccessful (data not shown). 


12 


199በ6 Aq eLOz ‘yı yoe uo /610 use iAl//.d3u wos pepeojuwoq 


ዐን 
=> 
ô 
19) 
ዐን 
=+ 
N 
(9) 
ር. 
ሓሸ - 
o. 
1 
Q 
ህን 
=) 
(= 
= 
389] 
(45) 
-- 
ር... 
O 
ርጋ 
እው) 
< 


Journal of Virology 


Journal of Virology 


While spike protein binding analyses suggest phenotypic differences between these 
viruses in vivo, the reported gross clinical signs in field cases between 2011 and 
2014 were not markedly different. Attempts to study the pathogenesis of GfCoV/2014 
by inoculating commercial guinea fowls with GfCoV-containing fecal samples did, 
unfortunately and in contrast to a previous study (6), not result in manifestations of 
clinical signs or convincing detection of viral RNA by RT-QPCR (data not shown). 
Whether this was due to previous exposure of commercial birds to GfCoV and hence 


circulating antibodies preventing the infection remains to be investigated. 


Here, we have demonstrated that GfCoV/2014-S1 has higher affinity for glycan 
receptors and increased avidity for the lower gastrointestinal tract compared to 
GfCoV/2011-S1. The viral genetic diversity between these spikes and the 
implications for receptor recognition further add to our understanding of this virus for 


which models are basically lacking. 


MATERIAL AND METHODS 

Collection of field samples. Samples were collected from guinea fowls showing 
enteritis and concomitant high mortality (21096) in flocks in five regions in France 
(Bretagne, Pays de Loire, Nouvelle-Aquitaine, Occitanie, and Auvergne-Rhóne- 
Alpes) from February 2014 through November 2016. Gastrointestinal content was 


collected and stored at -80°C for viral RNA isolation. Tissues (duodenum, pancreas, 


airsac, lung, ‘small intestine’, large intestine, kidney, cloaca, trachea and bursa) were 
collected during necropsy, fixed for 24h in 496 buffered formaldehyde (m/v) and 


stored in 7096 ethanol. 
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Immunohistochemistry. Paraffin-embedded tissues were sliced at 4um and 
deparaffinized in xylene and rehydrated in an ethanol gradient from 100%-70%. 
Antigen retrieval was carried out in Tris-EDTA pH 9,0 (preheated) before applying 
1% H202 in methanol. After washing twice in Normal antibody diluent (Immunologic) 
mAb mouse anti IBV M protein 25.1 (Prionics, Lelystad, The Netherlands), cross 
reacting with TCoV and GfCoV (5) was applied for 1 hour at room temperature (RT). 
Slides were washed in PBS-0,1%Tween and EnVision kit (cat. no. K4001; Dako) was 
used for anti-mouse secondary antibody staining according to the manufacturers 
protocol. Slides were washed three times in PBS and viral M-protein presence was 
visualized with AEC. The tissues were counterstained with hematoxylin and mounted 


with AquaMount (Merck). 


Molecular characterization of GfCoV. The gastrointestinal content collected from 
affected guinea fowl was clarified by centrifugation (30 sec at 11.000xg), and RNA 
was extracted using a Qiagen Viral RNA extraction kit following the instructions of the 
manufacturer. A one-step real-time RT-PCR targeting the avian coronavirus N-gene 
was carried out to confirm the presence of coronavirus RNA as previously described 
(13). Subsequently, the isolated RNA was reverse transcribed using the Revertaid kit 
with random hexamers (Thermo Fisher, Waltham, MA), and overlapping conventional 
PCRs were performed to amplify the guinea fowl S-gene (primer sequences available 
upon request). Sanger sequencing of the resulting fragments was performed using 
PCR primers. Contigs were generated with BioEdit (version 7.0.8.0) (25) and 
submitted to NCBI. Muscle (26) was used for the alignment, and Mega (version 6.06) 


with bootstrap value of 1000 for the phylogeny (27). Selective pressure was 
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calculated as dN/dS, and the dN=dS hypothesis was tested using Pamilo-Bianchi-Li 


method (28) with a p<0.05 considered statistically significant. 


Construction of the expression vector. The codon-optimized sequence for 
GfCoV/2014-S1 (yCoV/AvCoV/guinea fowl/France/14032/2014; NCBI MG765535), 
containing an upstream Nhel and downstream Pacl restriction site, was obtained 
from GenScript and cloned into the pCD5 expression vector by restriction digestion 
(as previously described (29)). The S1 sequence is in frame with a C-terminal GCN4 
trimerization motif and Strep-Tag. The expression vector encoding GfCoV/2011-S1 


was generated previously (29). 


Production of recombinant proteins. Recombinant S1 proteins were expressed by 
transfection of human embryonic kidney (HEK293T) cells with pCD5-expression 
vectors using polyethylenimine (PEI) at a 1:12 (w/w) ratio. Cell culture supernatants 
were harvested after six days. The recombinant proteins were purified using Strep- 


Tactin sepharose beads as previously described (29). 


ELISA. Gal 1,4GIcNAc 1,3Gal 1,4GIcNAc (Consortium for Functional Glycomics), 
was coated in a 96-well maxisorp plate (NUNC, Sigma-Aldrich) at 0.5 gg/well 
overnight at 4'C, followed by blocking with 3% BSA (Sigma) in PBS-0.1% Tween. S1 
proteins were pre-incubated with Strep-Tactin HRPO (1:200) for 30 minutes on ice. 
For each protein, 2-fold dilutions were made in triplicate in PBS, and applied onto the 
coated well, followed by incubation for 2 hours at room temperature. TMB (3,3’,5,5’- 
tetramethylbenzidine, Thermo Scientific) substrate was used to visualize binding, 


after which the reaction was terminated using 1M H2SO4A. Optical densities 
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(OD450nm) were measured in FLUOstar Omega (BMG Labtech), and MARS Data 
Analysis Software was used for data analysis. Statistical analysis was performed 


using a 2-way ANOVA. 


Glycan array. Glycan structures were printed in six replicates on glass slides 
(NEXTERION® Slide H, Schott Inc.). Prelabeled S1-proteins with Alexa647-linked 
anti-Strep-tag mouse antibody and with Alexa647-linked anti-mouse IgG (4:2:1 molar 
ratio) were applied to the slides (concentrations in figure legends) and incubated for 
90 minutes, after which the slides were washed with PBS and deionized water, dried 
and imaged immediately. 

As controls different lectins were applied: Erythrina cristagalli agglutinin (ECA), which 
is specific for glycans with terminal galactose, N-acetylgalactosamine, or lactose and 
Sambuca nigra agglutinin (SNA) and Maackia Amurensis Lectin | (MAL1) which are 
specific for alpha 2,6 linked and alpha 2,3 linked sialic acids attached to terminal 
galactose respectively. Of the six replicates, the highest and lowest value were 
removed, and of the remaining four the total signal and SD values were calculated 


and plotted in bar graphs or heatmaps. 


Spike histochemistry. Spike histochemistry was performed as previously described 
(29). S1 proteins pre-complexed with Streptactin-HRPO were applied onto 4 um 
sections of formalin-fixed paraffin embedded healthy guinea fowl tissues and binding 
was visualized using 3-amino-9-ethyl-carbazole (AEC; Sigma-Aldrich). Proteins were 
applied onto slides at 5 ug/ml. Where indicated the tissues were treated per slide with 


40ህ B-galactosidase (Gal; Megazyme, USA) or 2 mU Neuraminidase (Sialidase) 
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from Arthrobacter ureafaciens (AUNA, Sigma, Germany) in 10 mM potassium 


acetate, 2,5 mg/ml TritonX100, pH 4.2 at 40°C O/N before protein application. 


Lectin histochemistry. Lectin histochemistry was performed as previously 
described (4). Biotinylated-Erythrina cristagalli lectin or Biotinylated- Sambucus nigra 
lectin (both Vector Laboratories) were diluted in PBS to a final concentration of 2 
ug/ml (ECA) or 6 ug/ml (SNA) and applied to healthy guinea fowl tissue sections for 
30 min. After washing with PBS the signal was visualized by an Avidin-Biotin 


complex (ABC kit; Vector Laboratories) and counterstained with hematoxylin. 


Data availability. Contigs are available in GenBank under accession numbers 


MG765535 to MG765542 and MK290733 to MK290734. 
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FIGURE LEGENDS 


Figure 1. (Immuno)histological analyses of guinea fowl intestinal tract. 


Representative images of duodenum and colon from a guinea fowl presented with 


peracute enteritis in 2014 after staining with H&E (left) or antibodies against the M 


protein of infectious bronchitis virus, known to cross react with GfCoV-M protein in 


immunohistochemistry (IHC, right). Black arrowheads indicate inflammatory cells and 


white arrowheads indicate viral protein expression. 
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Figure 2. Molecular phylogenetic analysis by Maximum Likelihood method 
comparing GfCoV (partial) spike sequences. Phylogenetic tree based on the 
Kimura 2-parameter model, in which bootstrap values are shown next to the 
branches. The analysis involved 22 nucleotide sequences. All positions containing 
gaps and missing data were eliminated. There were a total of 893 nucleotide 
positions in the final dataset. Evolutionary analyses were conducted in MEGAG. * 


indicate partial S1 sequences of GfCoV. 


Figure 3. Binding of GfCoV-S1 to the enteric coronavirus glycan receptor 
diLacNAc. Concentration-dependent binding of GfCoV-S1 proteins to 
Gal_1,4GIcNAc_1,3Gal_1,4GIcNAc in ELISA. As negative control, IBV-M41-NTD 
was taken along (10); 1: significant difference between GfCoV-S1 and IBV-M41, 2: 


significant difference between GfCoV/2014-S1 and GfCoV/2011-S1 (p<0.001). 


Figure 4. Glycan binding specificity of guinea fowl S1 proteins. Schematic 
representation of selected glycan structures present on the glycan array; numbers 
correspond to those shown in the graphs (A). Number 1-4 represent glycans ending 
with galactose, number 5-8 glycans capped with alpha 2,3 linked sialic acids, number 
9-12, glycans capped with alpha 2,6 linked sialic acids. Glycan receptor specificity of 
GfCoV-S1 proteins (B) and lectins ECA, MAL1 and SNA (C) in glycan array assay (F. 
Broszeit and R.P. de Vries, submitted for publication); RFU: relative fluorescent units; 
yellow circle: galactose, blue square: GIcNAc, green circle: mannose, pink diamond: 


NeuAc. 
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Figure 5. Glycan binding affinity of guinea fowl S1 proteins. Glycan binding of 
GfCoV/2014-S1 (A) and GfCoV/2011-S1 (B) are shown as heatmaps with 5-fold 
dilutions (100 g/mL to 4 pg/mL) of the proteins applied to glycan array slides that 
are scanned with different laser intensities. RFU: relative fluorescent units; glycan 


numbers correspond to schematic representations shown in Figure 4A. 


Figure 6. Binding of GfCoV-S1 proteins to guinea fowl duodenum and colon 
without and with enzymatic pretreatment of the tissues. Spike histochemistry 
was performed on uninfected, healthy duodenum (A) and colon (B) tissues without 
and with pre-treatment of enzymes (AUNA and/or galactosidase) before applying 
GfCoV/2014-S1 and GfCoV/2011-S1. Binding of proteins was visualized by red 


staining. 


TABLE 1 Overview of selected guinea fowls and obtained GfCoV spike sequences. 


Animals with bold animal numbers were included for immunohistological examination 


as well. *ND = unknown 


TABLE 2 Relative binding of viral proteins and lectins on guinea fowl intestinal 


tissues. 


White box indicates no visible staining, light grey box indicates light to mild staining 
and/or not all epithelial cells show staining, dark grey indicates intense staining, most 


of the epithelial cells are showing positive signal. na = not analyzed 
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AUNA 


Galactosidase 
A) Duodenum 
GfCoV/2014-S1 
GfCoV/2011-S1 
GfCoV/2014-S1 
GfCoV/2011-S1 ' 


B) Colon 
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TABLE 1 Overview of selected guinea fowls and obtained GfCoV spike sequences. 
Animals with bold animal numbers were included for immunohistological examination as 
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well 
Animal Date of Age at Accession Nt identity (96) 
number sample sampling number: with 
collection time (Spike sequence) | GfCoV/2011-S1 
(week / year) (weeks) 
2011 LN610099.1 100 
nt: 1-3708 
14-002 6/2014 10 
14-013 15/2014 8 
14-032 22 / 2014 7 MG765535 85 
nt: 1-3669 
14-036 24 / 2014 7 
14-037 25 / 2014 7 
14-039 26 / 2014 5.5 
14-040 23 / 2014 ND* MG765536 88 
nt: 1-1392 
14-041 23 / 2014 ND* MG765537 88 
> nt: 1-1771 
= 14-042 23 / 2014 ND* MG765538 88 
= nt: 1-1392 
5 14-047 33 / 2014 3 MG765539 88 
5 nt: 1-1378 
3 14-053 37 / 2014 9 MG765540 88 
^ nt: 1-1393 
14-065 44 / 2014 12 
14-066 45 / 2014 4 MG765541 88 
nt: 1-1384 
15-006 3/2015 ND* MG765542 87 
nt: 1-980 
15-116 46 / 2015 7 
15-118 47 / 2015 8 
16-086 38 / 2016 ND* MK290733 85 
nt: 1-2465 
16-115 45 / 2016 4 
16-123 47 / 2016 ND* MK290734 86 
nt: 571-1895 


*ND = unknown 
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TABLE 2 Relative binding of viral proteins and lectins on guinea fowl intestinal 


tissues. 


|. s1 SEE s1 >> -.. NET 


Treatment | AUNA 
Galactosidase 


Tissue proventiculus 
duodenum 
pancreas 


jejunum 
ileum 


ceacum 
colon 


cloaca 


White box indicates no visible staining, light grey box indicates light to mild staining and/or not all epithelial cells 
show staining, dark grey indicates intense staining, most of the epithelial cells are showing positive signal. na = 


not analyzed 
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